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Means for identifying Neisseria meningitidis-specific genes. 



The invention relates to means for identifying genes specific to Neisseria 
meningitidis (Nm in abstract). It also relates to these genes and their 
5 biological applications. 

Nm is a strictly human bacteria which does not survive in the external 
environment. It's only known reservoir is the nasopharynx of humans. In 
certain circumstances which are still little understood, this bacteria will leave 
the nasopharynx, infiltrate the blood in circulation and cause septicaemia 

10 and/or meningitides. The existence of a meningitis suggests that the bacteria 
crosses the blood-brain barrier, one of the most difficult barriers to cross in the 
organism. Neisseria meningitidis is a bacteria having extracellular 
multiplication, in other words its dissemination in vivo is accompanied by a 
multiplication in the interstitial area. Very few bacteria having extracellular 

15 multiplication are capable of crossing the blood-brain barrier after the neonatal 
period, they are essentially Streptococcus pneumoniae, Haemophilus 
influenzae and Neisseria meningitidis. This property thus suggests specific 
attributes which allow these microorganisms to cross this barrier. 

Neisseria meningitidis presents two specificities for a bacteria having 

20 extracellular multiplication: 

(i) It is responsible for substantial bacteremia with a high number of 
bacteria in the blood. Thus, the comparison, in an animal model using the 
new-born rat, of the level of bacteremia induced by the injection of the same 
number of bacteria belonging to two different species (Neisseria meningitidis 

25 and Klebsiella pneumoniae) shows that N. meningitidis induces a bacteremia 
which can be 50-100 times greater than that induced by K. pneumoniae. This 
underlines the perfect adaptation of N. meningitidis to growth in the extra- 
cellular area. Certain bacterial attributes have already been identified as 
participating in this extracellular growth. These are essentially the 

30 polysaccharidic capsule, the lipooligosaccharide and the iron capture 
systems. The two first attributes allow resistance to the complement and to 
phagocytosis by the granulocytes and the third attribute allows the bacteria to 
obtain the iron essential for its growth. 
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(ii) The second particularity of N. meningitidis is related to its ability to 
cross the blood-brain barrier. This property results from an interaction with 
the cerebral endothelial cells. Until now the only bacterial attribute identified 
as being involved in the interaction of N. meningitidis at the cerebral 
5 endothelium level are the type IV pili. A molecule which is one of these pili 
called PilC, involved in this interaction, is the adhesin of the pili. 

The inventors work has concerned the search for means allowing 
identification of the genes of Nm which are capable of growing specifically in 
serum and of crossing the blood-brain barrier. 
10 The application to Nm of the technique described by Pelicic et al, 2000 for 
building a bank of mutants allowed mutagenization of more than 70% of the 
mutagenizable and thus non essential genes. 

This tool has proved to be particularly valuable for detecting in an 
exhaustive fashion all of the mutants for a given phenotype, for example those 
15 which are important for growth in the serum, and for identifying adhesins 
which are important for interaction with the endothelial cells and thus the 
crossing of the blood-brain barrier and this is without necessarily testing the 
mutants individually for this phenotype. 

Therefore the invention relates to the use of such a bank for detecting 
20 genes of Nm expressing a particular phenotype. 

It also relates to the genes involved in such a phenotype. 

The invention also relates to the exploitation of the thus-identified genes as 
Nm anti-pathogenicity targets. 

It also relates to the use of the genes coding for adhesins to allow 
25 therapeutic ingredient to pass through the blood-brain barrier. 

The invention moreover relates to the essential genes of N. meningitidis, 
and their homologues in other bacterial species and their use as targets for 
developing antibiotics. 

According to the invention, genes of pathogenic bacteria, in particular of 
30 Nm, are detected, expressing a desired phenotype, according to a method 
characterized in that : 

- a bank of mutants generated from a given bacterial strain is used so that 
at least 70 % of the non-essential genes, and in particular at least 80 %, or 
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even more than 90 %, are mutagen ized by inserting a transposon in a reading 
frame, 

- the mutants are then brought into contact, either individually, or in pools, 
with an environment, such as a medium, an animal or cells, capable of 

5 interacting with the mutant bacteria expressing the desired phenotype, 

- when pools are used, the bacteria which have not reacted with the 
desired phenotype are recovered, 

- the mutated genes of these bacteria are identified and their involvement 
in said phenotype is verified. 

10 The bank of mutants is advantageously generated according to the 
method described by Pelicic et al. above. 

The contact stage is carried out by passing on serum or an animal model 
in vivo or cells which are able to react with the bacteria expressing the desired 
phenotype and, when pools of mutants are used, the bacteria which have not 

15 reacted with the desired phenotype are recovered. 

In order to identify the mutated genes of these bacteria and to verify their 
involvement in said phenotype, the mutants are organized into pools. For 
each mutant, the insertion sites are amplified using appropriate 
oligonucleotides. The amplification products are placed on a membrane made 

20 for example of nylon. The pools of mutants are placed under the conditions for 
which mutants are sought. Total DNA is prepared using bacteria obtained 
from each output pool and an amplification is carried out using 
oligonucleotides which served to amplify the insertion sites in the mutants of 
the pool. The amplification product then serves to hybridize the membranes 

25 which correspond to each pool. The mutants for which no amplification is 
detected are mutants for the phenotype considered. It will be observed that 
this technique allows the mutants in question to be retained, allowing each 
mutation to be retransformed in order to confirm the phenotype. 

The invention also relates to the genes which give a bacteria the ability to 

30 grow or to react with a given environment such as serum, an animal model in 
vivo, cells. 

These genes are characterized in that they can be obtained by the method 
defined above. 
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In particular the invention relates to the genes involved in the growth of 
bacteria in serum, chosen from the genes of Figure 3, identified with respect 
to the number of the pool of mutants of Figure 2. 

Quite particularly the invention relates to the genes Nm 83 dxr, Nm 229, 
5 Nm 356, Nm 848 galU, and the genes of serogroup B, Nm 1771 and NmB 65 
as novel products. 

The invention also relates to the application of the genes selected in 
relation to the growth phenotype in serum, as anti-pathogenicity targets, which 
consists of inhibiting the growth of Nm in vivo in serum. 
10 Other genes of great interest according to the invention are characterized 
in this that they are involved in the interaction with endothelial cells. 

Therefore the invention also relates to the application of these genes to 
allow the opening of the blood-brain barrier to therapeutic ingredients, such as 
medicaments for Parkinson's Disease, Alzheimer's disease, antimitotics, 
15 medicaments for multiple sclerosis, antivirals, antimycotics and antibiotics. 

Moreover the invention relates to the essential genes of Nm for which no 
mutant is present in the bank and the application of these genes as targets for 
developing antibiotics. 

Other characteristics and advantages of the invention are given in the 
20 examples which follow and with reference to Figures 1 to 3 which represent: 

- Figure 1, the list of genes presenting in the 2 sequenced strains of Nm 
more than 65% similarity on a proteinic base, 

- Figure 2A, the list of genes for which there exists a mutant in the bank 
and Figure 2B the list of mutants classified into 96 pools of 48 mutants, and 

25 - Figure 3, the list of the mutants altered in growth in the serum. 

• Construction of a bank of mutants of Nm 8013 

1. A bank of mutants is built from the A/, meningitidis 8013 strain of 
serogroup C, operating according to the technique described by Pelicic et al, 
30 Journal of Bacteriology, 2000, 182: 5391-5398. A sequenced bank of 4547 
mutants is obtained. 
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Statistically 80% of the insertions are in open reading frames since it 
concerns the % of coding regions in the genome of the 2 sequenced strains, 
namely Z2491, strain of serogroup A sequenced by the Sanger Center, and 
MC58, strain of serogroup B sequenced by TIGR. Therefore there are 
5 approximately 3600 mutants in open reading frames and in most cases, 
several insertions per gene. Taking into account the size of the genome, the 
mutagenesis thus concerns 93% of the mutagenizable genes. 

The statistic formula allowing calculation of the probability (P) that a gene 
is mutated is the following: 
10 P=1-e n/p 

n: number of mutants in the bank 
p: number of mutagenizable (non-essential) genes 
The second number can only be estimated. But according to studies of 
bacteria better characterized than Neisseria meningitidis, it is reasonable to 
15 estimate that 350 genes are essential to the survival of the bacteria. 
Consequently, there would be 1750 non-essential genes in the 
meningococcus, 92.6 % (1619) of which should be mutated in the bank. 

2. All of the insertions of this bank are sequenced according to the 
20 technique used for the sequencing of insertions, already described and 
published (Prod'hom et al. 1998. FEMS Microbiol Lett. 1858: 75-81). This 
technique uses a specific primer for the known sequence, in this case the 
transposon, and a second specific primer of a synthetic linker ligated to the 
reduced genomic DNA. The use of AmpliTaq Gold polymerase Perkin-Elmer 
25 is important for minimizing a non-specific hybridization of the primers. 

• Determination of the essential genes. 

An essential gene can be present only in a single strain. Any gene present 
in the two strains, the genome of which has been sequenced and for which a 
30 mutant does not exist in the bank of the invention is thus considered to be 
essential. 

The genes present in the two strains are given in Figure 1. The 
nomenclature used is that of the strain Z2491 (sequenced by Sanger). The 



1st filing 

6 

list given in Figure 1 was obtained by performing a TblastN of each reading 
frame of Z2491 in MC58, then keeping all the frames of Z2491 which had a 
homology percentage greater than 65 %. 

The list of the genes for which a mutant is present in the bank is 
5 represented in Figure 2A. The list of differential genes, i.e. those present in 
Figure 1 and not in Figure 2A, is high in essential genes. This list of 
differential genes includes genes which are homologous in other Gram- 
negative pathogenic bacteria, such as enterobacteria, Pseudomonas, 
Acinetobacter, or even certain Gram-positive bacteria. These genes 
10 constitute targets for developing broad spectrum antibiotics against these 
Gram-negative bacteria and broader spectrum antibiotics when these genes 
are homologous to certain genes of Gram-positive bacteria. 

• Screening of the bank for different phenotypes. 

15 For the screening, knowledge of the sequence of each insertion is applied. 
For this, the mutants are organized into pools of 48. For each mutant, the 
insertion sites are amplified using suitable oligonucleotides. Each 
amplification product is deposited on a nylon membrane. The pools of 48 
mutants are then placed in the conditions for which mutants are sought. The 

20 total DNA is prepared using bacteria obtained from each output pool and an 
amplification is carried out using oligonucleotides which served to amplify the 
48 insertion sites. The amplification product then serves to hybridize the 
membranes which correspond to each pool. The mutants for which no 
amplification is detected are mutants for the phenotype considered. 

25 

• Search for mutants» important for growth in serum 

As mentioned above, N. meningitidis is a bacteria having extracellular 
multiplication perfectly suited to this compartment. The invention therefore 
relates to identifying in an exhaustive way the attributes and the genes 
30 required for this growth. 



1 - Isolation of the strains: 
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The wild strain 2C43 wt (positive control) and Z5463 CPS- (non- 
capsulated strain, negative control) are isolated on a GCB plate (agar 5g/l); 
the mutants produced from the strain 8013 are isolated on a GCB plate + 
Kanamycin 100 pg/pl. 

5 

The culture is carried out over 14 - 18 hours, at 37°C, in 5% C0 2 . 

2 - Serum: 

The complemented human serum is stored at -80°C. After heating for 30 
10 min. at 56°C, the serum is decomplemented. Growth is produced for the 
controls and the mutants with systematically complemented and 
decomplemented serum. 

Each mutant is tested with a positive and a negative control to compare 
15 the growth curves produced on different days. 

3 - Inoculum: 

1 dose of well-isolated colonies is collected and dissociated in 5 ml of 
RPMI (GIBCO: RPMI 1640 medium with glutamax I; previously placed for 5- 

20 10 min. at ambient temperature before inoculation, to protect the bacteria from 
rapid variations of temperature). The mass of bacteria is taken up using a 
P1000, then vortexed. The preculture is subjected to stirring at 37 °C for 2 
hours. The OD is then measured at 600 nm (the white control being RMPI) 
and the inoculum is returned to 0.1 in RPMI (previously placed for 5-10 min. at 

25 ambient temperature). 

4 - Growth medium: 

98 pi of serum and 292 pi of RPMI (25% serum, 75% RMPI) is deposited 
per well and left for 5 min. at ambient temperature before inoculation. 

30 

400 pi of water is introduced into the optionally empty wells. 



5 - Inoculation: 
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After stirring, 10 pi of inoculum adjusted to 0.1 of OD is collected, and it is 
deposited in a well containing growth medium, then mixed using a P1000. 
The wells are placed in an oven at 37°C, in 5% C0 2 . The inoculum is 
analyzed at TO and the bacterial growth at various times, by plating 50 pi of 
5 different dilutions on GCB plates. 

6 - Sampling: 

Suspension takes place again (with a P1000) before sampling at 0 hour, 1 
hour, 5 hours post inoculation. 20 (jl of inoculated culture medium is taken 
10 which is placed in 180 pi of RPMI (D1; tube 1.5 ml, previously placed at 
ambient temperature for 10 min., before sampling, in order to avoid a large 
difference in temperature). The mixture is vortexed. 

7 - Dilutions: 

15 The tube Dl is vortexed, then 50 pi of D1 is sampled which is added to 450 
pi of RPMI (D2; 2 ml tube, previously placed at ambient temperature for 10 
min.). Between each dilution stage vortexing is carried out and the cone is 
changed. Dilutions are carried out up to the dilution D4 for the time TO, D3 for 
the time T1 , and D5 for the time T5. 

20 

8 - Inoculation: 

The inoculation takes place on a GCB plate for the controls, and GCB+ 
kanamycin 100 pg/pl for the mutants. Vortexing is carried out, then 50 pi is 
taken from each dilution, followed by incubating upside down in an oven at 
25 37°C, in 5% C02, for 14-18 hours, before counting the colonies. D4, 3 are 
inoculated for the time TO; DO, 1, 2, 3 for the time T1; D5, 4, 3, 2, 1 for the 
time T5. 

9 - Counting of the number of colonies (CFI) 
30 The growth curves (log 10 CFU as a function of incubation time in hours) 
are drawn up. 



• Identification of the adhesins for endothelial cells 
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The important adhesins for interaction on endothelial cells can be used to 
allow the opening of the blood-brain barrier and to allow medicaments to pass 
into the brain. 



HUVEC cells at confluence are inoculated in 24-well cell culture 
microplates at a density of 10 5 /well. The cells are washed the following day in 
10% serum/RPMI, and are incubated for 2 hours at 37°C. At the same time, 
the bacteria are resuspended in the same medium at an OD 550 of 0.1 to 0.01 
and incubated for 2 hours at 37°C. The suspension of bacteria is used to 
infect the cells for 30 min at 37°C. 

The infection then continues for 4 - 5 hours with the cells being washed 
each hour. 
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CLAIMS 

1/ Exhaustive method for detecting pathogenic bacteria genes, in particular 
Nm, expressing a desired phenotype, characterized in that : 

- a bank of mutants generated from a given bacterial strain is used so that 
5 at least 70% of the non-essential genes, and in particular at least 80%, or 

even more than 90%, are mutagenized by inserting a transposon in a reading 
frame, 

- the mutants are then brought into contact, either individually, or in pools, 
with an environment, such as a medium, an animal or cells, capable of 

10 interacting with the mutant bacteria expressing the desired phenotype, 

- when pools are used, the bacteria not having reacted with the desired 
phenotype are recovered, 

- the mutated genes of these bacteria are identified and their involvement 
in said phenotype is verified. 

15 21 Method according to claim 1, characterized in that, in the contact stage, 
the mutants of the bank are passed through serum. 

3/ Method according to claim 1, characterized in that, in the contact stage, 
the mutants of the bank are passed over endothelial cells. 

4/ Genes which give a bacteria the ability to grow or to interact with a given 
20 environment, such as serum, an in vivo animal model, cells, characterized in 
that they can be obtained by the method according to any one of claims 1 to 
3. 

5/ Genes according to claim 4, characterized in that they are involved in 

the growth of the bacteria in serum and are chosen from those in Figure 3. 
25 6/ Genes according to claim 5, characterized in that they are chosen from 

Nm 83 dxr, Nm 229, Nm 356, Nm 848 galU, and the genes of serogroup B, 

Nm B 1771 and Nm B 65. 

II Genes according to claim 4, characterized in that they are involved in 

the interaction with endothelial cells. 
30 8/ Application of the genes selected according to the method of claim 2, or 

according to claim 5 or 6, as anti-pathogenicity targets, which consists in 

inhibiting Nm growth in vivo in the serum. 
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CLAIMS 

1/ Exhaustive method for detecting pathogenic bacteria genes, in particular 
Nm, expressing a desired phenotype, characterized in that : 

- a bank of mutants generated from a given bacterial strain is used so that 
5 at least 70% of the non-essential genes, and in particular at least 80%, or 

even more than 90%, are mutagenized by inserting a transposon in a reading 
frame, 

- the mutants are then brought into contact, either individually, or in pools, 
with an environment, such as a medium, an animal or cells, capable of 

10 interacting with the mutant bacteria expressing the desired phenotype, 

- when pools are used, the bacteria not having reacted with the desired 
phenotype are recovered, 

- the mutated genes of these bacteria are identified and their involvement 
in said phenotype is verified. 

15 21 Method according to claim 1, characterized in that, in the contact stage, 
the mutants of the bank are passed through serum. 

3/ Method according to claim 1, characterized in that, in the contact stage, 
the mutants of the bank are passed over endothelial cells. 

4/ Nm genes, characterized in that they are chosen from dxr (1-deoxy-D- 
20 xylulose-5-phosphate reductoisomerase), galU (UTP-glucose-1 -phosphate 
uridylyltransferase), and the gene of serotype B hisC (histidinol-phosphate 
amino transferase). 

5/ Application of the genes selected according to the method of claim 2, or 
according to claim 4, as anti-pathogenicity targets, for the screening and 
25 manufacture of inhibitors of Nm growth in the serum. 

6/ Application of the genes selected according to the method of claim 3, or 
according to claim 4, for the screening and manufacture of medicaments 
allowing the opening of the blood-brain barrier to therapeutic ingredients such 
as medicaments for Parkinson's Disease, Alzheimer's disease, antimitotics, 
30 medicaments for multiple sclerosis, antivirals, antimycotics and antibiotics. 

7/ Application according to one of claims 5 or 6, characterized in that 
essential genes of Nm are used as targets for the screening and manufacture 
of antibiotics. 
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9/ Application of the genes selected according to the method of claim 3, or 
according to claim 7, to allow the opening of the blood-brain barrier to 
therapeutic ingredients such as medicaments for Parkinson's Disease, 
Alzheimer's disease, antimitotics, medicaments for multiple sclerosis, 
antivirals, antimycotics and antibiotics. 

10/ Application of the essential genes of Nm as targets for developing 
antibiotics. 
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515 Genes mutated on 29/11/01 



Figure 2 A 



The mutants in red present an increased sensitivity to the bactericidal action of the 
complement of human serum. 

Nm2: nuoL, NADH dehydrogenase ! chain L 8/1 
Nm3: unknown 35/25 

NmlO: nuoG, NADH dehydrogenase I chain G 16/33, 26/39 

Nml5: nuoE, NADH dehydrogenase I chain E 8/41 

Nml7: mtoC, NADH dehydrogenase I chain C 29/44, 57/25 

Nml8: nuoB, NADH dehydrogenase I chain B 7/3 9 \ 64/45' 

Nm20: probable integral membrane protein 20/44 

Nm22: possible transposase for FS/0/d 9/17 

Nm24: unknown 20/7 

Nm29: unknown 93/8 

Nm37: unknown 95/37 

Nm39: unknown 5/31, 24/16', 53/27', 93/13 

Nrn42: pyrD, dihydroorotate dehydrogenase 79/39 

Nm47: probable transmembrane transport protein 71/19 

Nm48: pglA, giycosyltransferase 18/48 

Nm50: katA, catalase 60/22 

Nm60: probable transmembrane transport protein 4/37 1 , 9/37, 80/37' 
Nm65: probable lipoprotein 19/44 
Nm67: unknown 49/30 

Nm70: rlvC, probable ribosomal large subunit pseudouridine synthase C 1/44 
Nm73: probable amino-acid transport protein 14/46, 17/12 
Nm77: gidB y probable glucose inhibited division protein B homolog 20/24 
Nm83: dxr 9 probable ! -deoxy-D-\y!uioseo-phosphate rcductoisomerase 80/24 
Nm84: probable integral membrane protein 5/38 

Nm 1 55 : "unknown 7/3 7? 13/31', 25/45' " ~ 

Nml57: unknown 47/48 

Nrn 1 60: probable two-component sensor 92/36 
Nm!62: SUN homolog 19/43 

Nm 1 65: probable periplasmic protein 8/7, 8/15, 95/23 
Nml73: unknown 51/35 
NmlS2: probable lipoprotein 9/42 

Nml83: gltS, sodium/giutamate symport carrier protein 72/40 
Nm 1 84: unknown 19/1, 23/8, 57/10 

Nrn 185: h'pB, capsule polysaccharide modification protein 19/14', 22/14', 25/9, 86/2 
Nml86: iipA. capsule polysaccharide modification protein 41/29, 80/4, 93/19 
NmI93: probable modification methylase 15/20, 25/6 

N m 1 95 " ttii.K \yxhdrk. cajole j x jKsacd sstuc exfxxt A" i 1 > Ayix -ing pi^ea i 1 7/5 1 , 18/15', 93/4 7 
NinlOiS; ar.l. capsule polysaccharide export outer membrane protein 13/16, 43/8 



Nm207: probable integral membrane protein 55/35 

Nm221 : possible transposase for ]$H06 12/5 

Nm225: possible lipoprotein 5/36, 15/9 

Nm22:S: unknown 28/25 

Nm229: unknown 93/38 

Nm233: probable lij)oprutcin 16/20 

N*m247: mutS, ON A mismatch repair protein M/7, 1 1/16 

Nm251: probable periplasmic protein 1/21, J2/7\ 12/17", 12/19', 19/13", 94/38 
N f rn255: ftsX, probable ARC transporter inlegral membrane protein 16/5, 93/43 
Nm259: probable transmembrane transport protein 92/1 I 
N ni274: possible glycerate dehydrogenase 39/2(1'. 47/12' 

Nm279: probable membrane-bound lytic murom !r,m.> glycosylate 6/29, 70/13, 76/7, 95/47 
Nm28:>: probable lipoprotein 38/2 

Nm2X7: ihpA, probable thiamin-binding periplasm ic protein 1/38 
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Figure 2A 



515 genes mutated on 29/11/01 

The underlined mutants present an increased sensitivity to the bactericidal action of 
the complement of human serum. The figures in bold identify each mutant by pool 
number/mutant number. 

Nm2: two!., NADH dehydrogenase I chain L 8/1 
Nm3: unknown 35/25 

NmlO: mtoG, NADH dehydrogenase 1 chain G 16/33, 26/39 

Nm3 5: mtofi, NADH dehydrogenase I chain E 8/41 

Nmi7: nvoC, NADH dehydrogenase I chain C 29/44, 57/25 

Nml8: nuoB, NADH dehydrogenase I chain B 7/39% 64/45' 

Nm20: probable integral membrane protein 20/44 

Nm22: possible transposase for \SI0J6 9/17 

Nm24: unknown 20/7 

Nm29: unknown 93/8 

Nm37: unknown 95/37 

Nm39: unknown 5/31, 24/16', 53/27% 93/13 

Nm42: pyrD, dihydroorotate dehydrogenase 79/39 

Nm47: probable transmembrane transport protein 71/19 

Nm48: pgtA, glycosy] transferase 18/48 

Nm50: kafA, catalase 60/22 

Nm60: probable transmembrane transport protein 4/37', 9/37, 80/37* 
Nm65: probable lipoprotein 19/44 
Nm67: unknown 49/30 

Nm70: rmC, probable ribosomal large subunit pseudouridine synthase C 1/44 
Nm73: probable amino-acid transport protein 14/46, 17/12 
Nm77: gidB, probable glucose inhibited division protein B homolog 20/24 
NmS3: dxr, probable l-deoxy-D-xylulo se-5- phosphate reductoisomerase 80/24 
Nm84: probable integral membrane protein 5/38 

Nml 55: unknown 7/37, 13/31% 25/45' 
Nmi 57: unknown 47/48 

Km 160: probable two-component sensor 92/36 
Nml 62: -SUN homolog 19/43 

Nml 65: probable periplasmic protein 8/7, 8/15, 95/23 

Nml 73: unknown 51/35 

Nmi 82: probable lipoprotein 9/42 

Nm 1 S3: gltS, sodium/glutamate symport carrier protein 72/40 
Nml 84: unknown 19/1, 23/8, 57/10 

NmI85: lipB. capsule pol ysaccharide modification protein 19 / 14V 22/14% 25 /9» 86/2 
Nml 86, //;;/% capsule poly saccharide modification protein 41/29. 80/4. 93/19 
Nml 93: probable modification methyiase 15/20, 25/6 

Nml95: ^Ij^Mfegm^ 17/5% 18 /15% 93/47 

Nm J 98; cfrA. capsule pol ysaccharide e xport outer membrane protein J 3/1 6 ; 43/8 

Nm207: probable integral membrane protein 55/35 

Nm221: possible transposase for \Si!06 12/5 

Nm225' possible lipoprotein 5/36, 15/9 

Nm228. unknown 28/25 

N m22 f i..n oknown 93/38 

Nm21»3" probable lipoprotein 16/20 

Nm247- /W///.V, JXNA mismatch repair protein 1 1/7, 1 1/16 

Nni251: probable peiiplasmic protein 1/2!, 12/7% 12/17'% 12/19% 19/13'% 94/38 
Nm2ro: fisX\ probable ABC transporter integral membrane protein 16/5, 93/43 
Nni259: probable transmembrane transport protein 92/! I 
Nm274: possible glyceric dehydrogeimse 39/20% 47/12' 

Nm27'J probable membrane-bound lytic muroin transglycosylase 6/29, 70/13, 76/7, 95/47 
Nm285. probable lipoprotein 38/2 
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Nm289: unknown 16/1 
Nm292: unknown 10/43 

Nm293: pilC? y pilus-associated protein 2/26, 94/28 
Nm298: probable symport protein 95/14 



Figure 2A 
(cntd) 



Nm307: unknown 12/28 

Nm3 i 9: mafB\ possible MafB alternative C -terminus 8/30 

Nm320: unknown 13/43 

Nm325: mafA, adhesin 5/25 

Nm348: putative nuclease 3/30, 20/4 

Nm349: probable integral membrane protein 46/7 

Nm353: probable integral membrane protein 22/45, 90/40 

Nm356: possible transferase 55/48 

Nrn365: probable permease 4/33 

Nm368: probable integral membrane protein 12/36 

Nm369: hemK, HemK protein 81/39 

Nm370: probable integral membrane protein 46/43 

Nm372: unknown 7/1 8\ 8/27' 

Nm374:ppc, phosphocnolpyruvate carboxylase 18/33, 18/38, 84/2 
Nm380: unknown 46/35 
Nm382: unknown 10/9 

Nm397: thiC, thiamin biosynthesis protein 13/33 



Nm409: probable amino-acid transport protein 4/24 _____ 

Nm413: gntP, probable gluconate permease 10/47 
Nm4l4: probable permease 2/16, 5/46 

N*m422: probable pseudouridine synthase 6/9\ 13/19, 20/1 5\ 77/IS 
Nm425: unknown 47/7 

Nm428: conserved hypothetical protein 3/31 
Nm433: hrpA\ ATP-dependent DNA helicase 44/4 
Nm434: hypothetical inner membrane protein 61/14 
Nm436: hypothetical inner membrane protein 93/32 
Nm44l: conserved hypothetical protein 12/25 

Nm445:/wrZ., phosphoribosylformyiglycinamidine .synthase 4/17, 10/14', 1 i/8\ 75/34 
Nm457: iga2 9 IgA-specific serine endopeptidase 9/21, 40/31 
Nm465: hypothetical inner membrane protein 7J/2 
Nm478: possible outer membrane peptidase 6/43 
Nm48 1 : unknown 27/2 1 

Nm485: probable ABC transport ATP-binding sub unit 5/4, 16/45 
Nm486: possible ABC transport inner-membrane subunit 15/32 
Nm489: hypothetical protein 61/13 

Nm490: possible periplasrnic/outer membrane protein 96/22 
Nm497: probable inner membrane protein 71/18 
Nm499: regG, possible regulator o\pilE expression 7/44 

Nm503: possible transglycosylase 6/19, li/17, 16/17" 

Nm506: possible lipoprotein 6/12 

Nm5)2: otpl : probable A TP synthase 1 8/23 

Nm524: ! :< tA, lacU>-N-neolotraose biosynihesi %^lyc«-;yl iranicniso 3/8, 10/2, 21/21, 45/34 
Nm53l: pseudogene, probable Iraiwpusase remnant 90/36 
Nm546: unknown 20/9 

Nm559: teuS, probable leucyl-fKNA synthetase 68/22 
Nm560. Jrg, probable type (1 rcslricrion eruJoniidca.se 57/15 
Nm580; argA, probable acctylgli!t;iin>!tc synthase 7/25 
Nm582:/>yr/i, probable orotate piiospiiorihysyliPinsferasc 19/36, 57/31 
Nm587://./a, probable fructose- 1 .6-bisphosphute .iklolase 2 1/25, 21/47 
Nm592: probable glut am ate- 1 -semialdehyde 2, 1 -c.uniiiomtUa.se 17/10 

Nni5 l M: orn, probable oligoribonuelease 14/25 

NIS159'/ 1 : qiteA, probable S-adcnasyli-iioiiiii»iiiiiu-ri;Kisylir;iii.srcni.se-isofncrasc 91/2, 96/23 



received 01/07/02 



Figure 2A 

Nm287: tbpA, probable thiamin-binding periplasmic protein 1/38 (cntd) 
Nm289: unknown 16/1 
Nm292: unknown 10/43 

Nm293:/>/7C2, pilus-associatcd protein 2/26, 94/28 
Nm298: probable symport protein 95/14 

Nm307: unknown 12/28 

Nm319: maf8\ possible MafB alternative C-terminus 8/30 

Nm320: unknown 13/43 

Nfm325: ma/A, adhcsin 5/25 

Nm348: putative nuclease 3/30, 20/4 

Nm349: probable integral membrane protein 46/7 

Nm353: probable integral membrane protein 22/45, 90/40 

Nm356: possible transferase-55/48 

Nm365: probable permease 4/33 

Nm368: probable integral membrane protein 12/36 

Nm369: hemK, HemK protein 81/39 

Nm370: probable integral membrane protein 46/43 

Nm372: unknown 7/18', 8/27 1 

Nm374: ppc, phosphoenol pyruvate carboxylase 18/33, 18/38, 84/2 
Nm380: unknown 46/35 
Nm382: unknown 10/9 

Nm397: MC> thiamin biosynthesis protein 13/33 

Nm409: probable amino-acid transport protein 4/24 
Nm413: gntP, probable gluconate permease 10/47 
Nm414: probable permease 2/16, 5/46 

Nm422: probable pseudouridine synthase 6/9% 13/19, 20/15% 77/18 

Nm425: unknown 47/7 

Nrn428: conserved hypothetical protein 3/31 

Nm433: hrpA\ ATP-dependent DNA helicase 44/4 

Nm434: hypothetical inner membrane protein 61/14 

Nm436: hypothetical inner membrane protein 93/32 

Nm441: conserved hypothetical protein 12/25 

Nm445:/w/£, phosphoribosylformylglycinaniidine synthase 4/17, 10/14% 1 1/8% 75/34 
Nm457: jga2 t IgA-specific serine endopeptidase 9/21, 40/31 
Nm465: hypothetical inner membrane protein 71/2 
Nm478- possible outer membrane peptidase 6/43 
Nm4S! : unknown 27/21 

Nm485: probable ABC transport ATP-binding subunit 5/4, 16/45 
Nm486: possible ABC transport inner-membrane subunit 15/32 
Nm489: hypothetical protein 61/13 

Nm490: possible pcriplasmic/outer membrane protein 96/22 
Nm407: probable inner membrane protein 71/18 
Nm499: rcgCJ, possible regulator of pilh) expression 7/44 

Nm503: possible iransglycosylase 6/19, 11/17, 16/17 

Nm506: possible lipoprotein 6/1 2 

Nm5i2: aipf, probable ATP synthase 1 8/23 

Nm53 j : psondogene, probable Iransposasc remnant 90/36 
Nm546: unknown 20/9 

Nm559: hutS. probable Icucyi-'ilNA synthetase 68/22 

Nm56(); probable type li restriction cndonuclcase 57/15 

Nm580: urg/i, probable acetyl glutamic synthase 7/25 

Nm582: pyrl\ probable oroiatc phosphorihosyllransi erase 19/36, 57/31 

Nni587 Jhes, probable fructose- i ,6- bisphosphatc aldolase 21/25, 21/47 

Nin592: hew!. y probable «Iiifamalc-l -somialdohydc 2, l-aminomufase 47/10 

Nm594. probable oligoribonucleasc 14/25 

N in 599. queA, probable S-adenosylniethionine-ribosyltransferase-isoincrase 91/2, 96/23 
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Nm600: unknown 11/28 

Nm60 1 : probable lysR-fan\\\y transcriptional regulator 47/37 
Nm602: carB, carbamoyl phosphate synthase 2/13, 94/39 
Nm6l3: probable marR-h\m\\y regulator 92/41 
Nm618: hypothetical protein 5/14, 91/41, 92/45 

Nm6 1 9: rx^fob lirxpoiys^ 1 

Nm629: unknown 9/6 

Nm630: unknown 10/4 

Nm63 I: unknown 6/46, 9/23, 15/22, 34/9 

Nm636: avtA, probable valine-pyruvate aminotransferase 7/22 

Nm638: pglC 7 pilin glycosylation protein 68/25 

Nm639: pglB, pilin glycosylation protein 90/45, 93/45 

Nm643: possible iipopolysaccharide biosynthesis translocase 43/13 

Nin646: unknown 7/16 

Nm650: pilQ, pilus secretin 10/33, 72/13 

Nm654: pilM, probable pilus assembly protein i 1/33, 19/33 

Nm670: possible two-component system sensor kinase 1/14 

Nm672: unknown 69/23 

Nm679: unknown 1/43, 15/17 

Nm687: probable periplasmic protein 96/46 

Nm688: j7?a£-like gene 3/24, 14/31, 16/2, 48/7, 87/6 

Nm690: unknown 10/21, 22/1 1 

Nm69I: unknown, Asn-rich N-terminus 46/46 

Nm692: unknown 4/7, 20/27 

Nm693: unknown within a region of low G+C % 3/19 
Nm694: unknown within a region of low Q vQ % 7/17 
Nm696: unknown 1/40, 7/42 



Nm714: probable transmembrane hexose transporter 2/14, 5/47 
Nm715: probable transmembrane transport protein 1/2, 6/45, 15/40 
Nm720: unknown 4/19, 14/7 

Nm722: probable transmembrane transport protein 41/25 
Nm725: adhA, probable alcohol dehydrogenase 7/20, 51/22 
Nm726: probable pilin (ac/P) 1 1/24 

Nm729: probable ABC transporter ATP-binding protein 92/43 
Nm730: possible thiol-disulphide interchange protein 7/9, 10/18 
Nm740: unknown 6/10 

Nm74l: possible ubiquinone biosynthesis protein 92/15 

Nm747: nqrF, probable Na"-transiocating NADH-ubiquinone reductase subunit F 90/34 
Nm750: nqrC, probable Na'-translocating NAD I f-ubiquinone reductase subunit C 12/45 
Nm752: /x^prd^ieK^-te 19/27,9609 
Nm753: possible membrane protein 7/46', 19/19, 28/21% 51/14, 95/45 
Nm755: unknown 70/ i J', 71/6', 73/5, -73/18 
Nm762: hypothetical protein 18/16, 90/29 
Nm7S6: possible membrane protein 78/23 
Nm796: unknown 92/2 

NmSOOTpossiW membrane protein 7/10, 29/28, 52/1, 69/14 

NmSOl: unknown 6/5% 12/8' 
NmS02: unknown 30/25 

Nm804: possible sec-independent proiein translocase component 93/25 

Nntf!l6: probable polyamine permease A TP-binding protein 19/8 

Nm8 1 7: probable polyamine permease inner membrane protein 9/36 

NmX 1 9: possible oxidoruductase 96/31 

Nm820: umiB. probable ammonium transporter 5/30 

Nm826: ppsA, probable phosphoeiioipyruvnle synthase 8/47 

Nm827: unknown 7/1 1 

NmS28: probable phosphatase 36/3 

Nm8>2: possible glycosyj irunsfciase 14/29, 48/26 

Nn>S-13: fbpfi* probable iron-uptake perfnea.se inner membrane proiein 56/34 
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Figure 2A 
(cntd) 

MrnGOO : unknowiri" i ?28 ~ ~ " ~ 

Nm601: probable /yv/?- family transcriptional regulator 47/37 

Nm602: carB, carbamoyl phosphate synthase 2/13, 94/39 

Nm613: probable marR-hmily regulator 92/41 

Nm618: hypothetical protein 5/14, 91/41, 92/45 

Nm619: possible GrxDpOiVsacdiaride rrocSfrabon acyitransfease 31/22, 46*36, 47/1 4, 96/1 1 

Nm629: unknown 9/6 

Nm630: unknown 10/4 

Nm631 : unknown 6/46, 9/23, 15/22, 34/9 

Nm636: avtA, probable valine-pyruvate aminotransferase 7/22 

Nm638: pglC, pilin glycosylation protein 68/25 

Nm639: pgIB, pilin glycosylation protein 90/45, 93/45 

Nm643: possible lipopoly saccharide biosynthesis translocase 43/13 

Nm646: unknown 7/16 

Nm650:/;//O, pilus secretin 10/33, 72/13 

Nm654: pi!M y probable pilus assembly protein 1 1/33, 19/33 

Nm670: possible two-component system sensor kinase 1/14 

Nm672: unknown 69/23 

Nm679: unknown 1/43, 15/17 

Nm687: probable periplasmic protein 96/46 

Nm688: JhaB-Yike gene 3/24, 14/31, 16/2, 48/7, 87/6 

Nrn690: unknown 10/21, 22/11 

Nrn69I : unknown, Asn-rich N-terminus 46/46 

Nm692: unknov/n 4/7, 20/27 

Nrn693: unknown within a region oflow CHC % 3/19 
Nm694: unknown within a region of low G+C % 7/17 
Nm696: unknown 1/40, 7/42 

Nm714: probable transmembrane hexose transporter 2/14, 5/47 
Nm7I5: probable transmembrane transport protein 1/2, 6/45, 15/40 
Nm720: unknov/n 4/19, 14/7 

Nm722: probable transmembrane transport protein 41/25 
Nm725: adhA, probable alcohol dehydrogenase 7/20, 51/22 
Nm726: probable pilin (ac/T J ) 1 1/24 

Nm729: probable ABC transporter ATP-binding protein 92/43 
Nm730: possible thiol-disulphide interchange protein 7/9, 10/18 
Nm740: unknown 6/10 

Nm74I : possible ubiquinone biosynthesis protein 92/15 

Nm747: nqrF, probable Na ^-translocating NADH-ubiquinone reductase subunit F 90/34 
Nm750: nqrC, probable Na' -translocating NADH-ubiquinone reductase subunit C 12/45 
N m 75 2 : nfA y probable Na f ^aJT&caljr*> NADl-kibk{iai->one reductase aixmit A 19/27, 9(v39 
Nm753: possible membrane protein 7/46\ 19/19, 28/21', 51/14, 95/45 
Nm755: unknown 70/H', 7 J/6', 73/5, 73/18 
Nm762: hypothetical protein 18/16, 90/29 
Nm786: possible membrane protein 78/23 
Nm796: unknov/n 92/2 

NmSOO: possible integral membrane protein 7/10, 29/28, 52/1, 69/14 
NmKOl- unknown 6/5*, 12/8' 
Nm802. unknown 30/25 

Nm804* possible sec-independent, protein translocase component 93/25 

Nn>816- probable polyamine permease ATP-binding protein 19/8 

Nm817- probable polyamine permease inner membrane protein 9/36 

NmBIO possible oxidnreductasc 96/31 

Nm820 amf/i, probable ammonium transporter 5/30 

Nm826: ppsA, probable phosphoenolpyruvate synthase 8/47 

Nm£27. unknown 7/11 

Nm828. probable phosphatase 36/3 

Nm832. possible glycosyl transferase 14/29,48/26 
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Nm845: unknown 6/8 , 

Nm847: argff, probable argininosuccinate lyase 95/5 Figure 2A 

NmS48: gaiU, probable UTP-glucose- 1 -phosphate uridylyltraasi erase 32/37 (cntd) 

Nm867: ampD, probable anhydro-N-acetylmuramyl-tri peptide amidase 48/6 

Nm868: possible periplasmic protein 46/19 

Nm873: possible periplasmic protein 1 1/5 

Nm879: trpA, probable tryptophan synthase a chain 1.9/24, 65/48 

Nm884: pyrC y probable dihydroorotase 1/37, 89/9 

Nm888: rnc, probable ribonuc lease IH 6/47 

Nm890: trpF, probable N-(5'-phosphoribosyl)anthranilate isomerase 95/39 
Nm892: purF, probable amidophosphoribosyltransferase 60/4 

Nm904: trpB, probable tryptophan synthase B chain 5/41, 8/40, 27/34', 85/25' 

Nm905: iga t IgAl protease 1/39; 7/1, 9/25, 12/34 

Nm909: probable transmembrane transport protein 1/28, 9/22 

Nm915: unknown 3/37 

Nm92S: tgi, probable queuine tRNA-ribosyltransferase 95/9 
Nm935: unknown 4/28, 75/44 
Nm941: unknown 2/45 

Nm942: fe< probable adsrx^sy inn 33/2 
Nm944: unknown 7/13 
Nm948: unknown 32/26 

Nm952: recN, probable DNA repair protein 68/1 1 
Nm955: unknown 5/7 

Nm962: probable D-alanyl-D-alanine-endopeptidase 1/48, 56/18, 96/16 

Nm96S: purC. probable phosphoribosylaminoimidazole-succinocarboxamide synthase 5/15 

Nm973: probable integral membrane protein 7/40, 44/34 

Nm977: lepA, probable GTP-binding protein 90/15 

Nm979: pilT2, possible pilus retraction protein 90/19 

Nm981: possible pilus biogenesis protein 11 /22 

Nm982: unknov/n 1/24, 7/33 

Nm987: probable secreted protein 64/5 

Nm990: unknown 80/2 

Nrn993: possible periplasmic protein 72/33 

Nm994: possible periplasmic protein 16/39 

Nm995: recB, probable exodeoxyribonuclease V 13 chain 8/42, 9/28, 41/35 
Nm999: probable amino acid permease integral membrane protein 73/39 



Nm 1 00 1 : pgtn, phosphoglucomutase 2/46, 6/35, 25/5, 94/2 
Nrnl()02: ppiB, probable peptidyl-prolyl cis-trans isomerase B 94/18 
Nml003: probable transmembrane transport protein 29/42 
Nml008:y?j7, probable cell division protein 15/J 
Nml012: probable carbon-sulphur lyase 6/28, 92/4 

NrnlOlS: probable NAD(P)I l-flavin oxidoreductase 25/14, 29/32Nml016: probable RNA-bmding protein 93/44 
Nml020: probable ftf/rt-family transcriptional regulator 28/4iNml022: possible transmembrane efflux protein 
8/22, 37/27Nml034: r/aE, probable '\W-heplose synthetase !3/6Nm!035: nm%ll, probable modification 
mcthylase 60/25, 93/1 INm 1037: rfliD, probable ADFM.-ulycero-mannohepUise epimerase 7/30, 47/39Nml04S: 
unknown I8/3735/37Nml052: real probable single-stranded-DNA-specillc cxonuclcase 43/9Nml0.v4: possible 
periplasmic protein 6/2Nml056: unknown 5/ INm 1062: r<!«C. unknown 2/39Nrn 1 073: unknown 11/13, 
83/7Nml074: possible membrane protein 10/16, 47/27Nm 1084: possible periplasmic protein 15/26 
NrnlOM- unknown 7/35Nml087: possible transferase 4/29, 84/1 7Nm 1090: possible periplasmic protein 7/27, 

l9 /, 2 __ ___ . Nm ! 1 07: 

possible" membrane protein 96/8Ninllb8: possible membrane protein 34/7Mmlll0: probable pilin 12/39, 
4l/23Nml III: unknown !8/19Nml ! 18: ^ CM^iudyta W\ficy40,73M* 
<WSNmll26: probable transmembrane transport protein 46/38NmM36: mctX, probable homoserme ()- 
acclyltransferasc 3/11, M/42Nml 1 38: rpmi\2 t possible additional 50S ribosomul protein 1,3 I 96/2 
Nml 139: rnvtF, probable 5,10-melhyleneietrahydrofolaie reductase 92/29 
Nm 1 1 40: /;:c*/%[:itW.>k:^ 
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Nm843: JbpB, probable iron-uptake permease inner membrane protein 56/34 
Nm845: unknown 6/8 

Nm847: argff, probable argininosuccinate lyase 95/5 

Nm867: ampl\ probable anbydro-N-acetyhriuramyl-tripeptide amidase~4S/6 
Nm868: possible penplasmic protein 46/19 
Nm873: possible penplasmic protein 1 1/5 

Nrn879: irpA, probable tryptophan synthase a chain 19/24, 65/48 
NmS84: pyrC, probable dihydroorotase 1/37, 89/9 
Nm888: rnc, probable ribonuclease Til 6/47 

NmS90: trpF\ probable N-(5'-phosphoribosyl)anthraniiatc isomerase 95/39 
Nm892:/w/7% probable amidophosphoribosyltransferase 60/4 

Nm904: (rpB s probable tryptophan synthase 6 chain 5/41, 8/40 27/34' 85/25' 

Nm905: iga, IgAI protease t/39, 7/1, 9/25, 12/34 

Nm909: probable transmembrane transport protein 1/28, 9/22 

Nm91 5: unknown 3/37 

Nm928: tgt, probable queuine t RNA- ribosy It ran sferase 95/9 
Nm93 5: unknown 4/28, 75/44 
Nm94 1 : unknown 2/45 

Nm942: ^Hpn^adenosyim^^ 33/ 
Nm944: unknown 7/13 
Nm948: unknown 32/26 

Nm952: reciV, probable DNA repair protein 68/1 1 
Nm955: unknown 5/7 

Nm962: probable D-alanyl-D-alanine-endopeptidase 1/48, 56/18 96/16 

225* ^u' K^° babIe ? h OS P h u oribos y Iaminoi ^ d ^le-succinocarboxamide synthase 5/15 
Nm97j : probable integral membrane protein 7/40, 44/34 

Nm977: lepA, probable GTP-binding protein 90/15 

Nm979: pil72, possible pilus retraction protein 90/19 

Nm981: possible pilus biogenesis protein 11/22 

Nm9S2: unknown 1/24, 7/33 

Nm987: probable secreted protein 64/5 

Nm990: unknown 80/2 

Nm993: possible periplasmic protein 72/33 

Nm994: possible penplasmic protein 16/39 

Nm995: rccB, probable exodeoxyribonuciease V* 8 chain 8/42, 9PH 41/35 
Nm999: probable amino acid permease integral membrane protein 73/39 

NrnlOQ] , fKn7 j nhosphoglucomut asc 2/46. 6/35. 25/5. 94/2 

Nml002: ppi/J, probable peplidyl-prolyl cis-trans isomerase B 94/18 

Nm!003: probable transmembrane transport protein 29/42 

Nm 1 008://*/, probable ceil division protein 15/1 

Nml()12: probable carbon-sulphur lyase 6/28, 92/4 

Nmi015: probable NAi)(P)H-flavin oxidoreductase 25/14, 29/32 

Nm 1 016: probable RNA-binding protein 93/44 

NmJ020: probable /c//V-family transcriptional regulator 28/41 

Nml022: possible transmembrane cfllux protein 8/22, 37/27 

NlTLlp.4 : rfaK, probable | 3/6 

NmI035; ///«#//, probable modification mclhylasc 60/25, 93/1 I 

? IL H^ ; ^r l p,0!> - ! - ! " ADP-K-Iycero : mnnnc,^ 7 /30. 47/39 

Nml056. unknown 5/1 Nm 1062: n/gC unknown ?/39 

NmlOTJ: unknown 1 1/(3. nSHNmitm- possible membrane protein 10/16, 47/27 
(vmiOSr possible penplasmic protein 15/26 

Nm 1 086; unknown 7/35Nm 1 087: possible iraiisferasc 4/29. 84/1 7 
Nin ! 090; possible periplasmic prot.cin7/27, 19/12 

Nnil 107- possible membrane pmtein 96/8Nm 1 1 08: possible membrane prolan 34/7 
Nm! 1 10: probable pilm 12/39, 41/23 



Figure 2A 
(cntd) 
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Nmi !43: sdhC, probaBle succinate dehydrogenase cytochrome B subunit 92/23Nml 145: sdhA, probable 
succinate dehydrogenase flavoprotein subunit 4/39Nmll46: sdhB, probable succinate dehydropgiyEP Qm- 
sulphur protein 18/44 (cntd) 
Nmll48: git A, probable citrate synthase 16/6\ 20/34', 81/36NmIl49: sucA, probable 2-oxogJutarate 
dehydrogenase 13! component 15/24Nml 1.52: possible membrane protein 13/7, 93/18, 95/6Nmll62: unknown 
96/1 * 

Nml 163: frpG, probable anthraniiate synthase component 11 26/22', 48/1'NmI 164: trpD t probable anthranilate 
phosphoribosyltransfera.se 9/l9Nml 174: unknown 1 !/45Nml 184: possible lipoprotein 1 l/25NmI 186: unknown 
12/1 8Nm 1195: ' unknown 

6/33 , Nml 200: 

probable surface fibril protein 47/19', 52/12 r Nm 1206: probable oxidoreductase 6/48, 34/8, 77/1 7Nm 1207: 
unknown 14/24Nml228: unknown within a region of unusually low G+C % 23/29Nrn 1 243: probable sulphate- 
binding protein 7/26Nml248: possible periplasmic protein 5/37Nml249: ubcZ, probable ABC transporter ATP- 
binding protein 2/8, 78/6Nrnl250: dedA, hypothetical protein 19/5NmI254: jztyA, probable serine 
hydroxy methyltransferase 42>20Nml255: ggt, probable gamma-glutamy (transpeptidase 14/8, 92/27,* 
94/l5"Nm 1263: unknown 89/L6Nml272: unknown 95/2 

Nml276: possible membrane protein 5/43, 16/13, 20/40Nml285: unknown 94/37 

Nml 286: possible phage transposase 

82/19 : Nml 307: 

unknown 95/32 

Nm i 3 1 3 : unknown 1 8/1 3, 22/1 8, 94/35Nm 1315: unknown 1 8/3 1 , 93/1 0 

Nml 325: possible phage tail fibre protein 15/19Nml33 1: ABC transporter ATP-binding protein 3/43, 22/25 
Nm 1 334: possible periplasmic protein 1/32, 43/1 7 
Nml 34 1: unknown 12/46 
Nml 348: unknown 76/13 

Nml 351: possible RNA methyltransferase 76/37 

Nm 1 356: mpi, possibb UDP-N-aoetytmL2^^ 11/38, 12/15 

Nm 1 362: cys/ 7 probable "sulphite reductase 6 subunit 25/23 

Nml 363: cy.v.7, probable sulphite reductase a subunit 4/38, 11/12 

Nml 365: cysD, probable sulphate adenylate transferase subunit 2 19/29 

Nml 366: cysH, probable phosphoadenosinephosphosulfate reductase 8/32 

Nml 367: cysG, sirohaem synthase 53/16 

Nml 369: unknown, similar to the C-terrninal half of bacterial hypothetical-proteins 8/34, 63/5 
Nml 371: vacB, probable ribonuclease 40/33 

Nml 372: gitaB, probable inosineo*-monophosphate dehydrogenase 65/17 

Nm 1 373: unknown 1 1/32, 27/42 

Nml 375: possible transcriptional regulator 95/8 

Nml 38 1 : probable integral membrane protein 74/27 

Nml 385: probable type I restriction-modification system modification protein 93/14 
Nml 393: possible ABC transporter ATP-binding protein 1 1/31 
Nml 395: hom, probable homoserine dehydrogenase 16/37 

NmMOl: recD, exodeoxyribonuclease V a subunit 13/5', 14/39', 31/25 
Nm!408: unknown 12/31 

Nmt409: probable ABC-transporter ATP-bindini; protein 10/42, 18/21 
Nml4Lv. unknown 7/47 

Nm 14 1 7: possible molylxioptcrin-guanine dinucieotide biosynthesis protein A 10/20, 34/28 

Nml4!S: probable two-component system sensor kinase 87/16 

Nml 421: purM y probable phosphoribosylformyl^lycinamidine cyclo-ligase 7/5 

NmI-125: rihA, probable OTP cyclohydrolase II 9I/3 

Nm 1 427: probable glycosyl transferase 42/9 

Nm 1 4.35: unknown within a region of low G I C % 1 9/6, 90/47 

Nml43S: possible integral membrane protein 15/6 

Nm 1*140: ihrC\ probable threonine synthase 87/24 

Nml '141: unknown 2 1/31 

Nm(445: unknown 9/15 

Nm 1448: raJ(.\ probable DMA Ycp:\'\v protein 3/36, 4/45 
Nml 450: lcuC\ probable )4sopropy!malale dehydratase; large subunit 68/2 
Nm 1456: ZcW*, probable 3-isopropylmahtfo dehydrogenase 4/9, 40/35 
Nml 459: aspA y probable aspartate ammonia lyase 19/34, 44/45, 52/9 
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Figure 2A 
(cntd) 

Nml 1 11: unknown 18/19 

Nml 118: L% CMP-r>kxetyL^ 9/46, 65*10, 73/46, 94/8 

Nml 126: probable transmembrane transport protein 46/38 

Nml 136: meiX y probable homoserine O-aceryltransferase 3/11, 14/42 

Nml 138: rpmE2, possible additional SOS ribosoma! protein 131 96/2 

Nml 139: metf \ probable 5, 1 0-methylenetetrahydrofolate reductase 92/29 

Nml J 40: /M^protebfeSnixri^ 

Nml 143: sdhC, probable succinate dehydrogenase cytochrome B subunit 92/23 

Nml 145: sdhA, probable succinate dehydrogenase flavoprotein subunit 4/39 

Nml 146: sdhB, probable succinate dehydrogenase iron-sulphur protein 18/44 

Nml 148: gliA, probable citrate synthase 1.6/6', 20/34\ 81/36 

Nml 149: sucA y probable 2-oxoglutarate dehydrogenase El component 15/24 

Nml 152: possible membrane prqtein 13/7, 93/18, 95/6Nml 162: unknown 96/1 

Nml 163: trpG, probable anthranilate synthase component TI 26/22', 48/1' 

Nml 164: irpD, probable anthranilate phosphoribosyltransferase 9/19 

Nml 174: unknown 11/45 

Nml 184: possible lipoprotein 11/25 

Nml 186: unknown 12/18 

Nml 195:unknown6/33 



Nml 200: probable surface fibril protein 47/19', 52/12' 
Nml206: probable oxidoreductase 6/48, 34/8, 77/17 
Nml 207: unknown 14/24 

Nml 228: unknown within a region of unusually low G+C % 23/29 
Nml 243: probable sulphate-binding protein 7/26 
Nml 248: possible periplasmic protein 5/37 

Nml 249: abcZ, probable ABC transporter ATP-binding protein 2/8, 78/6 
Nml 250: dedA, hypothetical protein 19/5 

Nml 254: glyA, probable serine hydroxym ethyl transferase 42/20 

Nml 255: ggl, probable gamma-glutamyltranspeptidase 14/8, 92/27, 94/15 

Nml 263: unknown 89/16 

Nml 272: unknown 95/2 

Nml 276: possible membrane protein 5/43, 16/13, 20/4 

Nml 285: unknown 94/37 

Nml 286: possible phage transposase 82/19 



Nml 307: unknown 95/32 

Nml 3 13: unknown 18/13, 22/18, 94/35 

Nml 3 15: unknown 18/31, 93/10 

Nml 325: possible phage tail fibre protein 15/19 

Mm 1331: ABC transporter ATP-binding protein 3/43, 22/25 

Nml334: possible periplasmic protein 1/32, 43/17 

Nml 341: unknown 12/46 

Nml 348: unknown 76/13 

Nm 1351: possible RNA methyltransicrase 76/37 

Nml356: //^rxx^UDP-N-ac^ 11/38,12/15 

Nml 362: cyst, probable sulphite reductase B subunit 25/23 

Nml 363: cy&J, probable sulphite reductase a subunit 4/38, 1 1/1.2 

Nml 365: cysl\ probable sulphate adenylate transferase subunit 2 19/29 

Nml 360. cysH, probable phosphoadonosinephosphosulfate reductase 8/32 

Nm 1 3 6 7 cys( /, si ro h acm sy n t ha se 53/ 1 6 

Nml 369. unknown, similar to the C-terminal half of bacteria) hypothetical proteins 8/34, 63/5 
Nml 371: vacft, probable ribonuclease 40/33 

Nml 372: guatt, probable laosine^ 1 -monophosphate dehydrogenase 65/17 

Nm 1.373 unknown 1 1/32, 27/42 

Nml 375: possible transcriptional regulator 95AS 

Nml 381: probable integral membrane protein 74/27 

Nml 385: probable type 1 restriction-modification system modification protein 93/J4 
Nml 393: possible ABC transporter ATP-binding protein 1 1/31 
Nmi 195: hnm y probable homoserine dehydrogenase 16/37 
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Nml460: probable integral membrane protein 1 1/27 
Nmi46l: probable e//ia/- family protein 5/35 
Nm 1465: unknown 95/25 

Nml466: probable type III restriction-modification system endonuclease protein 8/43 
Nm!467: type III restriction-modification system 4/25 
Nml473: unknown 15/35 

Nmi475: unknown 7/3!, 8/8', 8/18% 92/5", 92/12", 92/44 
Nml482: probable acyl-CoA ligase 5/39 

NmJ483: probable transmembrane transport protein 10/34, 14/1 1 
NmI485: gar, site-specific recombinase J3/39, 15/48 
Nml491: rnfcl, transcription-repair coupling factor 14/3 



NmlSOO: probable modification methylase 19/37', 36/37 1 

Nml507: possible membrane-bound lytic murein transglycosylase 5/28, 86/5, 90/27 
Nml508: possible ribosomal small subunit pseudouridine synthase 19/21, 56/44 
Nm 1509: possible sodium-dependent transporter 25/25 
N m 1 5 1 8 : cx#C\ aloof oi dehydfegenase class-Ill, gliitatho^lerxrdent fcimakJefyde dehydro^nase 1 8/25, 34/26, 94'48 
Nm 1 5 1 9: esci^ esterase D 1 3/44 
Nml523: possible lipoprotein 19/4 
Nml528: itraA, uracil permease 10/22, 40/6 
Nml533: conserved hypothetical integral membrane protein 7/14 
Nml540: uvrC, excinuclease ABC subunit C 63/12 
Nm 1541: unknown 3/5 

Nml542: conserved hypothetical protein 96/42 
Nml545: uvrB, excinuclease ABC subunit B 43/4 
Nml547: possible secreted lysine-rich protein 8/3 
Nml557: unknown 11/11 

Nml558: possible TonB-dependent receptor protein 1/31 

Nmi 565: probable aldehyde dehydrogenase 2/36 

Nm 1571: iron/sulphur-binding ox id o reductase 90/18 

Nm!574: possible integral membrane transporter 1 1/35, 46/17, 74/48 

Nmi 575: xseA, exonuclease VII large subunit 6/23 

Nml577: unknown 12/6 

Nmi 579: unknown 19/48, 66/14 

Nmi 580: probable ATP-dependent RNA helicase 53/9 
Nmi 584: argD % acetylornithine aminotransferase 9/48 
Nrnl589: conserved hypothetical protein 45/18 

Nmi 590: probable type III restriction/modification system modification methylase 15/5 
Nml592: UdA y {.-lactate dehydrogenase 8/4', 20/l r , 77/12 

Nmi 603: unknown 96/13 ~ 
NmI617: socfC, superoxide dismutase 1/47 
Nml639: probable integral membrane protein 6 J/48 
Nml642: porA, porin, class I outer membrane protein 6/4 
Nmi 64 5: possible lipoprotein 92/13 

Nmi 646: pho-spbolipase D- family protein X/36, 14/4, 40/22, 95/48 

Nm!647: probable integral membrane protein 4/14, 10/3, 27/35 

Nmi 660: rep, ATP-dependent DNA helicase 22/2 

Nmi 665: possible DNA polymerase 111 subunit 3/48, 9/27 

Nm\679: ppXfyppA, possible e.xophosphatase 4/44 

Nml687: probable glutamale ciehytlrugeiia.se 15/36 

Nm 1689: possible regulatory protein 9/9 

Nm 1690: unknown 1 1/4 

Nmi 702: rhh, ribonucleic 6/j4 V ,~S/39 r " 

Nmi 709: binding-protcin-clcpondent transport systems inner membrane protein 3/28, 8/9 

N'm!7l5: possible integral membrane protein I 1/20 

Nmi 720: unknown 8/26 

Nmi 722: possible cis-trans isomcraso 3/47 

NmI72j: possible proline-rtch repeat protein 96/24 



Figure 2A 
(cntd) 
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Figure 2A 
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NmMOl: recD, exodcoxyribonuclease V a subunit i3/5\i4/39\ 31/25 
N ml 408: unknown 12/31 

Nml409: probable ABC-transporter ATP-binding protein 10/42, i 8/21 
Nml415: unknown 7/47 

Nrn)417: possible molybdopterin-guanine dinucleotidc biosynthesis protein A 10/20, 34/28 

Nml4 18. probable two-component system sensor kinase 87/1 6 

Nm 142 1 : purM > probable phosphoribosylformylglycinamidine cyclo-ligase 7/5 

Nml425: ribA > probable GTP cyclohydrolase 11 91/3 

Nml 427: probable glycosyl transferase 42/9 

Nml435: unknown within a region of low G*C % 19/6, 90/47 * 

Nml438: possible integral membrane protein 15/6 

N r ml440: thrC, probable threonine synthase 87/24 

Nrnl441: unknown 21/31 

Nml445: unknown 9/15 

Nml 448: radC, probable DNA repair protein 3/36, 4/45 

Nml 450: ieuC, probable 3-isopropylmalate dehydratase large subunit 68/2 

Nml456: leuB, probable 3-isopropylmalate dehydrogenase 4/9, 40/35 

Nml459: aspA, probable aspartate ammonia-lyase 19/34, 44/45, 52/9 

Nml460: probable integral membrane protein 1 1/27 

Nml461 ; probable <^«<z/-family protein 5/35 

Nml465: unknown 95/25 

"Nml 466: probable type III restriction-modification system endonuclease protein 8/43 
Nml467: type III restriction-modification system 4/25 
Nml 473: unknown 15/35 

Nml475: unknown 7/31, 8/8% 8/18% 92/5", 92/12", 92/44 
Nml 482: probable acyl-CoA ligase 5/39 

Nml483: probable transmembrane transport protein 10/34, 14/11 
NmJ485: gcr 7 site-specific recombinase 13/39, 15/48 
Nml491: mfd, transcription-repair coupling factor 14/3 

Nm!500: probable modification methylase 19/37', 36/37' 

Nml 507; possible membrane-bound lytic murein transglycosylase 5/28, 86/5, 90/27 
Nml508: possible ribosomal small subunit pseudouridine synthase 19/21, 56/44 
Nml 509: possible sodium-dependent transporter 25/25 

Nm 1 5 1 8 : aJC, aicohd dehydrogenase dass-HI, ^toieKkpendert formaldehyde deiiydrpgenase 1 8^ 34/26, 94/48 

Nml 5 19: esd, esterase D 13/44 

Nml 523: possible lipoprotein 19/4 

Nml 528. uraA, uracil permease 10/22, 40/6 

Nml 533: conserved hypothetical integral membrane protein 7/14 

Nml 540: i/vrC, excinuclcase ABC subunit C 63/12 

Nm 1 54 1 • unknown 3/5 

Nml 542: conserved hypothetical protein 96/42 
Nml 545: uvrB, excinuclcase ABC subunit 13 43/4 
Nml 547: possible secreted lysme-rich protein 8/3 
Nml 557: unknown 1 1/1 1 

Nm 1 558: possible TonB-dependcnl receptor protein J/31 

Nml 565: probable aldehyde dehydrogenase 2/36 

Nm ! 57 1 : iron/sulphur-binding oxidorcduetoso 90/1 8 

Nml 574: possible integral membrane transporter 1 1/35, 46/17, 74/48 

Nm 1 575: mv1, cxonuolease VII large subunit 6/23 

Nml 577: unknown 1 2/6 

Nml 57 < >: unknown 19/4S, 66/14 

Nml 580: probable ATP -dependent UNA holicase 53/9 

Nml 584: urgD, acctylomithine amino! ransl erase 9/48 

Nml 58 { ): conserved hypothetical protein 45/18 

Nml 590' probable type 111 restriction/modification system modification mclhyhisc 15/5 
Nml 592: UdA, I .-lactate dehydrogenase 8/4', 20/ 1', 77/12 

Nml 603: unknown 96/13 
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Nml727: rfaF, ADP-heptose:LPS heptosy [transferase i! 70/7 Figure 2A 

Nm 1 73 I : conserved hypothetical protein 94/27 & 
Nml740: IbpB, lactoferrin-binding protein 37/17 (cntdj 
Nm 175 I : possible gw/fl-Family transcriptional regulator 90/30 
Nm 1754: unknown 14/35 

Nml755: pitrN, phosphoribosylglycinamide transformylase 2/2 
Nml756: possible peptidyl-prolyl isomerase 1/42, 15/10 
Nml765: ilvH, acetolactate synthase isozyme (II small subunit 52/29 
Nml767: possible lipoprotein 96/34 

Nm!771: hisQ histidinol-phosphate aminotransferase 19/39 

Nm 1 775: partial ORF 93/46 

Nml 778: possible secreted protease 35/1 

Nml783: AraC-family transcriptional regulator 51/3 

Nml785: conserved hypothetical integral membrane protein 96/15 

Nml786: possible polyamine permease substrate-binding protein 29/6, 96/5 

Nm 1 8 XlTfitniA, fumarate hydratase 2/15, 4/41 

Nmi8l6: conserved, possible DNA-binding, hypothetical protein 93/12 
Nml819: conserved hypothetical protein 12/35, 92/21 
Nm 1 820: gpxA, glutathione peroxidase 1/13 

Nml886: nor, nitric oxide reductase 4/30, 8/38, 10/7, 12/43, 13/20, 67/32, 80/16 

Nml 887: aniA y nitrite reductase 17/6 

Nm 1 892: conserved hypothetical protein 2/27 

Nml 894: serC, phosphoserine aminotransferase 36/4 



Nml 900: possible haemolystn 8/12 

Nml 901 : possible sodium-alanine symporter 2/9, 22/5 

Nml 920: apt. adenine phosphoribosyltransferase 9/35, 13/35, 34/1, 49/6 

Nm 1 92 1 : conserved hypothetical protein 58/4, 86/44 

Nml 925: hmbR y haemoglobin receptor 1 1/19, 25/40, 62/9 

Nml 928: hypothetical integral membrane protein 93/33 

Nml 929: hypothetical integral membrane protein 9/40 

Nml 934: gcvP, glycine dehydrogenase 4/23, 18/39, 21/35, 32/27, 47/4 

Nml 935: unknown 59/44, 95/10 

Nml 936: probable cytochrome 71/35 

Nml 937: tyrB, aromatic amino acid aminotransferase 2/30, 20/19, 47/36, d7/6, 63/>.j 

Nml938: IrmA, tRNA (uracil-5->methyltransferase 16/21 

Nml 939: aroC, chorismate synthase 88/14 

Nml 940: unknown 23/31, 94/19 

Mm 1944: idhA. D-lactate dehydrogenase 60/10 

Nml 946: conserved hypothetical protein 18/28 

Nm 1951: unknown 4/6, 37/39 

Nml 953: unknown 80/23 

Nml 957' f'ibl 7 ? 3-oxoacyl-(acyl-canrier-protein) synthase 11 11/44, 17/8 
Nrnl958: fe/F, f.i-1.4-glucosy!transferase 3/15*, 23/19', 57/1 V\ 7 1/8", 7 I/I0"- 
Nml 960: unknown 2/li 

Nml 96 I: probable integral membrane ion transporter 10/12 
Nml969: mtrD, probable drug efflux protein 36/36 
Nml 970: mlrC, membrane fusion protein 76/30, 92/42 
Nml973: hypothetical integral membrane prut-.;in 2/40, 12/2 
Nm 1 975: possible integral membrane protein 27/20, 94/9 
Nm!98 3: exhi), biopolymer transport protein S5/3 
Nm 1984: exb(3> biopolymer transport protein 37/33 

Nml996: naiO\ possible periplasm* type I secretion .system protein, N-terminal region 6/42 



Nm2Q07: unknown 12/24 

Nm2009: unknown 3/10 

Nm20t0: hypothetical protein 5/10,49/17 

NnViOl 1: conserved hypothetical protein V UMk 95/36 

Nm20!2: conserved hypothetical protein 43/12 
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Figure 2A 

Nm 1617: sot/C, superoxide dismutase i/47 (cntd) 
Nml639: probable integral membrane protein 61/48 
Nml 642: porA, porin, class I outer membrane protein 6/4 
Nml645: possible lipoprotein 92/13 

Nml646: phospholipase D-f ami Iy protein 8/36, 14/4, 40/22, 95/48 
Nml 647: probable integral membrane protein 4/14, 10/3, 27/35 
Nml660: rep, ATP-dependent DNA helicase 22/2 
Nml665: possible DNA polymerase III subunit 3/48, 9/27 
Nml 679: ppX/gppA, possible exo phosphatase 4/44 
Nml6S7: probable glutamate dehydrogenase 15/36 
Nml689: possible regulatory protein 9/9 
Nml690: unknown 11/4 

Nml702: rbh y ribonuclease 6/34 ; , 8/39' 

Nml 709: binding-protein-dependent transport systems inner membrane protein 3/28, 8/9 

Nml715: possible integral membrane protein 11/20 

Nml720: unknown 8/26 

Nml722: possible cis-trans isomerase 3/47 

Nml723: possible proline-rich repeat protein 96/24 

Nml727: rfaF y ADP-heptose:LPS heptosyltransfcrase n 70/7 

Nml731: conserved hypothetical protein 94/27 

Nml740: fhpB, lactoferrin-binding protein 37/17 

Nml75I: possible g?7/7?-family transcriptional regulator 90/30 

Nm 1754: unknown 14/35 

Nml755: purN, phosphoribosylglycinamide transformylase 2/2 
Nml756: possible peptidyl-prolyl isomerase 1/42, 15/10 
Nmi765: ilvH, acetolactate synthase isozyme III small subunit 52/29 
NmI767: possible lipoprotein 96/34 

Nml771: hisC, histidinol-phosphate aminotransferase 19/39 

Nml775: partiaJ ORF 93/46 

Nml778: possible secreted protease 35/1 

Nm 1 783: AraC-family transcriptional regulator 51/3 

Nml785: conserved hypothetical integral membrane protein 96/15 

NmI736: possible polyamine permease substrate-binding protein 29/6, 96/5 

Nml8I2: fumA, fumarate hydratase 2/15, 4/41 

Nml816: conserved, possible DNA-binding, hypothetical protein 93/12 
Nmi8I9: conserved hypothetical protein 12/35, 92/2 i 
Nml820: gpxA, glutathione peroxidase 1/13 

NmlS86: not\ nitric oxide reductase 4/30, 8/38, 10/7, 12/43, 13/20, 67/32, 80/16 

NmJS87: amA, nitrite reductase 17/6 

Nm!S92: conserved hypothelical protein 2/27 

Nml 894: serC, phosphoscrinc aminotransferase 36/4 

Nml900: possible haemolysin 8/12 

Nml 901 : possible sodium-alanine symporler 2/9, 22/5 

Nml 920: apt, adenine phosphoribosyltransFcrasc 9/35, 13/35, 34/1, 49/6 

Nml 921: conserved hypothelical protein 58/4, 86/44 

Nml 925: hmhH, haemoglol)in receptor 1 1/19, 25/40, 62/9 

Nml 928: hypothelical integral membrane protein 93/33 

Nml 929. hypothelical integral membrane protein 9/40 

Nm 193*1: gcvl\ glycine dehydrogenase 4/23, 18/39, 21/35. 32/27, 47/4 

Nm 1 93 5: unknown 59/44, 95/10 

NmI930" probable cytochrome 71/35 

Nrn 19)7 iyr/>: aromatic amino acid aminotransferase 2/30, 20/19, 47/36, 57/6, 63/23 

Nin!93S: frmA, tRNA (nrncil-5-)-n^tliylt ransfcra.se 16/21 

Nml 930: am<\ chorisniato synthase 88/14 

Nml 940: unknown 23/31, 94/19 

Nml 94-1- Uh/U 1>-Iaclale dehydrogenase 60/10 

Nml 946: conserved hypothetical protein 18/28 
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Nm2016: speB, agmatinase 1/46', 42/46* 

Nm2020: possible integral membrane protein 92/1 S . Figure 2A 

Nm202 1 : possible phospho lipase 3/38 (cntd) 
Nm2028: amiC\ N-acetylmuramoyl-L-alanine amidasc 78/47 
Nm2029: unknown 7/21 

Nm2036: unknown within a region of low G+C % 56/23 

Nm2038: recO t DMA repair protein 16/44 

Nm2040: possible integral membrane efflux protein 95/33 

Nm2046: hypothetical integral membrane protein 1 1/18 

Nm2048: conserved hypothetical protein 41/12, 81/22 

Nm2052: acnA, aconitate hydratase 38/25 

Nm2054: prpQ citrate synthase 2 8/28, 91/29, 96/30 

Nm2069: probable integral membrane protein 3/4, 31/5 

Nm2076: conserved hypothetical protein 13/2 

Nrn2078: possible thiol-disulphide interchange protein 2/31, 24/13 

Nm2083: putP, sodium/pro line symporter 30/33 

Nm2084: putA % bifunctional PutA protein 13/42, 16/10 

Nm2086: xthA y exodeoxyribonuclease III 86/33 

Nm2092: nadB t L-aspartate oxidase 2/4, 47/24 

Nm2098: map A, possible maltose phosphorylase 5/34 

Nm2099: galM, aldose 1-epimerase (mutarotase) U/10 



Nm2100: possible maltose/H* symporter 9/38, 70/40 

Nm210l: ABC transporter ATP-binding protein 1/19', 5/18', 14/4, 38/23, 90/44 
Nm2i 13: mq/B2, adhesin 2/7, 4/42, 10/32, 38/24 

Nm21 18: hypothetical protein within a region of low G+C % 12/32, 24/1 1, 68/23 
Nm2120: hypothetical protein within a region of low G+C % 3/23, 78/24 
Nm2122: hypothetical protein within a region of low G+C % 20/13, 29/3 
Nm2l23:/r/?C', possible partial CDS 16/30 
Nm2l24: possible outer membrane protein 43/5 
Nm2l29: aroE, shikimate dehydrogenase 49/7 

Nm2l30: mtgA, monofunctional biosynthetic peptidoglycan transglycosylase 52/6 
Nm2l31: possible ABC transporter ATP-binding protein 13/10 
Nm2l36: tai, transaldolase 92/9 

Nm2137: hypothetical protein within a region of low G+C % 1 1/14*, 11/42', 96/47 
Nm2140: conserved hypothetical protein 1/1 1,4/13, 66/4 
Nm2142: possible periplasmic hypothetical protein 46/40 
Nm2145: possible morphogene 19/45,23/18, 32/22, 75/47 
Nm2146: tspA, Neisseria-$pec\fic antigen protein 9/9 

Nrn2151: ilvE, probable branched-chain amino acid aminotransferase 41/31, 66/34 

Nm21 55: pilG, pilus-assembly protein 2/19, 14/45, 25/19, 67/33, 85/28 

Nm2 1 56: pilD, prepilin leader peptidase 4/16 

Nm215S: conserved hypothetical protein 1/10 

Nm2 1 59: pilF, type IV piius assembly protein 73/27 

Nm2l67: hypothetical protein 1/1.2 

Nm2 168: emrB, multidrug resistance tninslocaso 33/39, 57/48 
Nm2171: conserved hypothetical inner membrane protein 4/35, 6/20 
Nm2172: conserved hypothetical protein, possible peptidase 6/38 
Nrn2l74: conserved hypothetical protein 6/41 
Nm2l75: vapA, possible virulence associated protein 29/25 
Nm2l7S: conserved hypothetical protein 2/48 

Nm2l80: aroC\ phospho-2-dehydro-3-deoxyheptonate aldolase 78/10 

Nm2 1 83: opcB, outer membrane protein 32/30 

Nm2i86: hypothetical protein 19/22 

Nm2 1 89: -hypothetical inner membrane protein 2AV, 2/10' 

Nm2l93: possible TonB-dopendent receptor protein 71/23 

Nm2200: dinCJ, probable ATP-dopendem DMA helicase 83/44 

Nm2204: hypothetical protein 1/23, 3/12 

Nm22l2: trpC % indole -3 -glycerol phosphate synthase 83/13 
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Figure 2A 
(cntd) 

Nm 1 95 1 : unknown 4/6, 37/39 
Nml953: unknown 80/23 

Nml957: fabF2, 3-oxoacYl-(acyl-carrier-protein) synthase II 1 1/44, 17/8 
Nml958:'/e// ; : B -1.4-^n^ Y ltransferasc 3/15\ 23/19\ 57/1 1 71/8". JAIML 
NmI960: unknown 2/11 

Nml961 : probable integral membrane ion transporter 10/12 
Nrnl969: mtrD, probable drug efflux protein 36/36 
Nml970: mtrC\ membrane fusion protein 76/30, 92/42 
Nml973: hypothetical integral membrane protein 2/40, 12/2 
Nml975: possible integral membrane protein 27/20, 94/9 
Nml983: exbD, biopolymer transport protein 85/3 
Nm 1 984: exbB } biopolymer transport protein 37/33 

Nml996: nalD\ possible pcriplasmic type I secretion system protein, N-terminal region 6/42 

Nm2007: unknown 12/24 

Nm2009: unknown 3/10 

Nm2010: hypothetical protein 5/10, 49/17 

Nm201 1 : conserved hypothetical protein 94/36, 95/36 

Nm2012: conserved hypothetical protein 43/12 

Nm20 L6: speB, agmatinase l/46\ 42/46* 

Nm2020: possible integral membrane protein 92/18 

Nm202 1 : possible phospholipase 3/38 

Nm2028: amiC, N-acetylmurarnoyl-L-alanine amidase 78/47 

Nm2029: unknown 7/21 

Nm2036: unknown within a region of low G+C % 56/23 

Nm2038: recO, DNA repair protein 16/44 

Nm2040: possible integral membrane efflux protein 95/33 

Nm2046: hypothetical integral membrane protein 1 1/18 

Nm204S: conserved hypothetical protein 41/12, 81/22 

Nm2052: acnA y aconitate hydratase 38/25 

Nm2054: prpC 9 citrate synthase 2 8/28, 91/29, 96/30 

Nm2069: probable integral membrane protein 3/4, 31/5 

Nm2076: conserved hypothetical protein 13/2 

Nm2078: possible thiol-disulphide interchange protein 2/31, 24/13 

Nm2083: pulP, sodium/proline symporter 30/33 

N"m20S4: put A, Afunctional Put A protein 13/42, 16/10 

Nm2086: xthA, exodeoxyribonuclease III 86/31 

Nm2092: nadB, L-aspartate oxidase 2/4, 47/24 

Nm2098: mapA, possible maltose phosphorylase 5/34 

Nm2099: galM, aldose 1-epimerase (mutarotase) 11/10 

Nm2100: possible maitose/I-T symporter 9/38, 70/40 

Nm2101 : ABC transporter ATP-binding protein 1/19\ 5/1 8\ 14/4, 38/23, 90/44 
Nm2 113: mufB2 % adhesin 2/7, 4/42, 10/32, 38/24 

Nm2 J 1 S: hypothetical protein within a region of low G * C % 12/32, 24/! 1 , 68/23 
Nm2l20: h.ypolhetical protein within a region of low Gh-C % 3/23, 78/24 
Nm2122: hypothetical protein within a region of low Cr'rC % 20/13, 29/3 
Nm2123://yX T ' s possible partial CDS 16/30 
Nm2i24: possible outer membrane protein 43/5 
Nm2 129. aro/% shikimate dehydrogenase 49/7 

Nm2130. migA, mono functional biosynthetic peptidoglycan Iransglycosylase 52/6 
Nm213 I : possible ABC transporter A'l P-binding protein 13/10 
Nm2136. iai, transaldoiase 92/9 

Nm2!37. hypothetical protein within a region of low GHl % 1 1/14', 1 1/42', 96/47 
Nm2l40: conserved hypothetical protein 1/11, 4/13, 66/4 
Nm2l42: possible pcriplasmic hypothetical protein 46/40 
Nm2145 possible morphogcnc 19/45, 23/18, 32/22, 75/47 
Nm2l46 ispA, Mfirnvw-specific antigen protein 9/9 

Nm2151 . //V/V, probable branchcd-chain amino acid aminotransferase 41/31, 66/34 
Nm2155 pinl pilus-assembly protein 2/19, 14/45, 25/19, 67/33, S5/28 
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Nm22l6: possible hydrolase 75/1 1 

Nm2218: possible tRNA/rRNA met by Itransf erase 1/15 

Nm2223: probable ABC transporter 16/9 

Nm2225: xsett, exodcoxyribonuc lease small subunit 42/37 



Figure 2A 
(cntd) 



Nml 88 or Nm205: ~^/f,glin^ 35/36', 35/39', 37/2', 58/8 

Nm293 or Nm609: pi!C2 or pilC /, pilus-associated protein 5/42 
Nm.525 or Nm527: l$B. lacto-N-neotetraose biosynthesis glycosyl tranferase 3/39 
Nm690 or Nm692: unknown 91/33 

Nm788 or Nml626: 6p4 (rjfa&tiam 3/17, 4/36', 7/36', 9/1, 13/41, 18/36, 19/7, 21/17", 24/17", 27/28, 65/27, 
67/22 

Nml 167 or Nm 1999: possible phage protein 12/37, 18/37 

Nrn 1 1 69 or Nm200 1 : unknown 78/44 

Nml 196 or Nm 13 10: unknown 8/44 

Nrn 1 1 86 or Nm 1 30 i : unknown 49/23, 56/37 

NrnI313 or Nrnl852: unknown 20/16 

Nm 1 780 or Nm 1 78 1 : doubtful ORFs 94/44 



NmB65: hypothetical protein 61/44, 77/4 

NmB69: siaB, polysialic acid capsule biosynthesis protein 28/22 
NmB70: synX, polysialic acid capsule biosynthesis protein 77/19 
NmJB91: hypothetical protein i/4 

NmB98: ABC transporter, ATP-binding protein 7/23, 65/6, 69/1 
NmB223: hypothetical protein 30/6 
NmB372: hypothetical protein 84/20 

NmB493: hem agglutinin/hemolysin -related protein 11/1, 12/1 

NmB648: hypothetical protein 5/40 

NmB653: MafB-related protein 12/21 

N"mB896: integrase 16/22, 43/15, 94/14 

NmBl502: hypothetical protein 4/3, 76/31 

NmB1603 tellurite resistance protein 28/10 

NmB1759: conserved hypothetical protein 9/10 

NmBl768: hemagglutinin/hemolysin-related protein 67/27 

NmB1771: hypothetical protein 29/41, 

NmBI772: hypothetical protein 16/25 

Nm£ 1829: Ton B -dependent receptor 4/32, 47/2 

NmBI830: putative phosphoglycolatc phosphatase 4/10 

NmB!854: hypothetical protein 43/48 

NmBI926: fc/£", lacto-N-neotetraose biosynthesis glycosyl trans fenise 79/25 
NmB2l32: trans ferr in-binding protein-related protein 93/5 

NmB497 or N m B 1 77 9 : he ma ggi ut i n i n/ho mo ly s i n - re 1 ated protein 14/26. 17/21 
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Nm2156: pilD, prcpilin leader peptidase 4/16 
Nm2158: conserved hypothetical protein 1/10 
Nm21 59: pitF, type IV pilus assembly protein 73/27 
Nm2167: hypothetical protein 1/12 

Nm2168: emrB y multidrug resistance translocase 33/39, 57/48 
Nm2171: conserved hypotheticaJ inner membrane protein 4/35, 6/20 
Nm2172: conserved hypothetical protein, possible peptidase 6/38 
Nm2t74: conserved hypotheticaJ protein 6/41 
Nm2175: vapA, possible viailence associated protein 29/25 
Nm2178: conserved hypotheticaJ protein 2/48 

Nm2180: aroG, phospho-2-dehydro-3-deoxyheptonate aldolase 78/10 
Nm2 1 83: opcB y outer membrane protein 32/30 
Nm2i86: hypothetical protein 19/22 
Nm2I89: hypothetical inner membrane protein 2/6', 2/10' 
Nm2193: possible TonB-dependent receptor protein 71/23 

Nm2200: dinG, probable ATP-dependent DNA helicase 83/44 
Nm2204: hypothetical protein 1/23, 3/12 
Nm22I2: trpC, indole-3 -glycerol phosphate synthase 83/13 
Nm22 1 6: possible hydrolase 75/1 1 
Nm2218: possible tRNA/rRNA methyltransferase 1/15 
Nm2223: probable ABC transporter 16/9 
Nm2225: xseB, exodeoxyribonuclease small subunit 42/37 

Nml88 or Nm205: ^igbccse-Iishaspha^ 35/36*, 35/39', 37/2', 58/8 

Nm293 or Nm609: pilC2 or pilCl, pilus-associated protein 5/42 
Nm5?5 or Nm527: /g/ff. lacto-N -neotetraose biosynthesis olvnnqvl tranferase 3/39 
Nm690 or Nm692: unknown 91/33 

Nm788 or Nmi626: tipA <xjtpC y exotoxin 3/17, 4/36', 7/36*, 9/1, 13/41, 18/36, 19/7, 21/17", 24/17", 27/28, 65/27, 
67/22 

Nmi 167 or NmI999: possible phage protein 12/37, 18/37 

Nm 11 69 or Nm200 i : unknown 78/44 

Nml 196 or Nmi 3 10: unknown 8/44 

Nml 186 or Nml 301: unknown 49/23, 56/37 

Nmi 3 13 or Nml 852: unknown 20/16 

Nml 780 or Nml 781: doubtful ORFs 94/44 

NmB65: hypothetical protein 61/44. 77/4 

NmB69: siaB y polysialic acid capsule biosynthesis protein 28/22 
NmB70: sytiX, polysialic acid capsule biosynthesis protein 77/1 9 
NmB91: hypothetical protein 1/4 

NmB9S: ABC transporter, ATP-binding protein 7/23, 65/6, 69/1 
NmB223: hypothetical protein 30/6 
NmB372: hypothetical protein 84/20 

NniB493: hemaggIutiniii/hemo!ysin-related protein 1 I/I, 12/1 

NmB648: hypothetical protein 5/40 

NmB653: MarB-related protein 12/21 

NmB896: integrase 16/22, 43/15, 94/14 

NmB 1502: hypothetical protein 4/3, 76/3! 

NmB 1603 tellurite resistance protein 28/10 

NmB1759: conserved hypothetical protein 9/10 

NmB1768: hemagglulinin/hemolysin-related protein 67/27 

NmB 177 1: hypothetical protein 29/41 

NrnB1772: hypothetical protein 16/25 

NrnB 1 829: TonB-dependent receptor 4/32, 47/2 

NmB 1 830: putative phosphogiycolate phosphatase 4/10 

NmB 1 854: hypothetical protein 43/48 

NmB2l32: iransfcrrin-bindina protein-related protein 93/5 

NmB 497 or NmB 1779: nemaggluiinin/hemolysin-reiated protein 14/26, 17/21 



Figure 2A 
(cntd) 
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List of genes important for resistance to the bactericidal action of the 

complement. 



Nm83: dxr, probable 1-deoxy-D-xylulose 5-phosphate reductoisomerase 

Hml85: lipB, capsule polysaccharide modification protein 

Nml86: lipA, capsule polysaccharide modification protein 

Nml95: crrD, probable capsule polysaccharide export ATP-binding protein 

just ahead "Nm!97: crrB capsule polysaccharide export inner-membrane protein 

Nml98: err A, capsule polysaccharide export outer membrane protein 

Nm229: unknown 

Nm356: possible transferase 

"Nm524: lgtAv, lacto-N-neotetraose biosynthesis glycosyl tranferase 

Nm525 or Nm527: IgtB or igtB2, iacto-N-neotetraose biosynthesis glycosyl tranferase 

»NmS48: galU, probable UTP-glueose-I-phosphate uridyiyitransferase 

NmlOOl: pgm, phosphoglucornutase 

Nml034: rfaE\ probable ADP-heptose synthetase 

Nml037: rfaD, probable ADP-L-glycero-mannoheptose epimerase 

Nir. 1958: !gtF. G- M-giu cosy 1 transferase 

25 bp ahead Nm2135: sugar isomerase 

Genes only present in serogroup B 

NmB1771: hypothetical protein 
NmB65: hypothetical protein 

NmB1926: IgtE, lacto-N-neotecraose biosynthesis glycosyl transferase 
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